Skip to main content
Log in

Does cumulative advantage affect collective learning in science? An agent-based simulation

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Agent-based simulation can model simple micro-level mechanisms capable of generating macro-level patterns, such as frequency distributions and network structures found in bibliometric data. Agent-based simulations of organisational learning have provided analogies for collective problem solving by boundedly rational agents employing heuristics. This paper brings these two areas together in one model of knowledge seeking through scientific publication. It describes a computer simulation in which academic papers are generated with authors, references, contents, and an extrinsic value, and must pass through peer review to become published. We demonstrate that the model can fit bibliometric data for a token journal, Research Policy. Different practices for generating authors and references produce different distributions of papers per author and citations per paper, including the scale-free distributions typical of cumulative advantage processes. We also demonstrate the model’s ability to simulate collective learning or problem solving, for which we use Kauffman’s NK fitness landscape. The model provides evidence that those practices leading to cumulative advantage in citations, that is, papers with many citations becoming even more cited, do not improve scientists’ ability to find good solutions to scientific problems, compared to those practices that ignore past citations. By contrast, what does make a difference is referring only to publications that have successfully passed peer review. Citation practice is one of many issues that a simulation model of science can address when the data-rich literature on scientometrics is connected to the analogy-rich literature on organisations and heuristic search.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Notes

  1. The simulation model, CitationAgents1, was developed initially in VBA within Excel 2003, and then, after a break of several months, reproduced using NetLogo 4.1. Replicating a simulation model in this way helps to verify that the program is working as intended. The extra work involved in replicating the model was worthwhile, as several minor errors in the original version were exposed. A version of it may be downloaded from OpenABM: http://www.openabm.org/model/2470.

  2. The exact relation between computing time and model scale is difficult to state, and differs between the VBA and NetLogo versions, for reasons internal to those software environments.

  3. The options for generating author lists also include those that produce cumulative advantage (in papers per author), but preliminary testing showed that these mechanisms have smaller effects than those for generating references, perhaps because papers have more references than authors on average.

References

  • Ahrweiler, P., & Gilbert, G. N. (2005). Caffè Nero: The evaluation of social simulation. Journal of Artificial Societies and Social Simulation, 8(4), 14. Retrieved 26, Feb 2010 from http://jasss.soc.surrey.ac.uk/8/4/14.html.

  • Axelrod, R. (1997). The complexity of cooperation: agent-based models of competition and collaboration. Princeton: Princeton University Press.

    Google Scholar 

  • Barabási, A.-L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286, 509–512.

    Article  MathSciNet  Google Scholar 

  • Bentley, R. A., Ormerod, P., & Batty, M. (2009). An evolutionary model of long tailed distributions in the social sciences. arXiv:0903.2533v1 [physics.soc-ph] March 14, 2009. Retrieved 1, May 2010 from http://arxiv.org/PS_cache/arxiv/pdf/0903/0903.2533v1.pdf.

  • Boerner, K., Maru, J. T., & Goldstone, R. L. (2004). The simultaneous evolution of author and paper networks. Proceedings of the National Academy of Science USA, 101(suppl. 1), S266–S273.

    Google Scholar 

  • Bornmann, L. (2010). Does the journal peer review select the ‘best’ from the work submitted? The state of empirical research. IETE Technical Review, 27(2), 93–96.

    Article  Google Scholar 

  • Burrell, Q. L. (2001). Stochastic modelling of the first-citation distribution. Scientometrics, 52(1), 3–12.

    Article  Google Scholar 

  • Burrell, Q. L. (2007). Hirsch’s h-index: a stochastic model. Journal of Informetrics, 1, 16–25.

    Article  Google Scholar 

  • Burt, R. (2005). Brokerage and closure: an introduction to social capital. Oxford: Oxford Univerity Press.

    Google Scholar 

  • Castellano, C., Marsili, M., & Vespignani, A. (2000). Nonequilibrium phase transition in a model for social influence. Physical Review Letters, 85(16), 3536–3539.

    Article  Google Scholar 

  • Chen, C., Chen, Y., Horowitz, H., Hou, H., Liu, Z., & Pellegrino, D. (2009). Towards an explanatory and computational theory of scientific discovery. Journal of Informetrics, 3, 191–209.

    Article  Google Scholar 

  • Clerc, M. (2006). Particle swarm optimisation. London: ISTE.

    Book  Google Scholar 

  • Clerc, M., & Kennedy, J. (2002). The particle swarm—explosion, stability, and convergence in a multidimensional complex space. IEEE Transactions on Evolutionary Computation, 6(1), 58–73.

    Article  Google Scholar 

  • Collins, R. (1998). The sociology of philosophies: a global theory of intellectual change. London: Belknap Press, Harvard University Press.

    Google Scholar 

  • Corne, D., Dorigo, M., & Glover, F. (Eds.). (1999). New ideas in optimisation. London: McGraw-Hill.

    Google Scholar 

  • Cyert, R. M., & March, J. G. (1963). A behavioral theory of the firm. Englewood Cliffs: Prentice-Hall.

    Google Scholar 

  • Fuller, S. (2000). The governance of science. Buckingham: Open University Press.

    Google Scholar 

  • Gilbert, N. (1997). A simulation of the structure of academic science. Sociological Research Online, 2(2), 3. Retrieved 10, Feb 2009 from http://www.socresonline.org.uk/socresonline/2/2/3.html.

  • Gilbert, G. N., & Troitzsch, K. G. (2005). Simulation for the social scientist. Maidenhead: Open University Press.

    Google Scholar 

  • Glaenzel, W., & Schubert, A. (1990). The cumulative advantage function. A mathematical formulation based on conditional expectations and its application to scientometric distributions. Informetrics, 89/90, 139–147.

    Google Scholar 

  • Glaenzel, W., & Schubert, A. (1995). Predictive aspects of a stochastic model for citation processes. Information Processing and Management, 31(1), 69–80.

    Article  Google Scholar 

  • Glover, F. (1989). Tabu Search—Part I. ORSA Journal on Computing, 1(3), 190–206.

    MATH  Google Scholar 

  • Glover, F. (1990). Tabu search—Part II. ORSA Journal on Computing, 2(1), 4–32.

    MATH  Google Scholar 

  • Hegselmann, R., & Krause, U. (2002). Opinion dynamics and bounded confidence models, analysis, and simulation. Journal of Artificial Societies and Social Simulation, 5(3), 2. Retrieved 28, Sep 2009 from http://jasss.soc.surrey.ac.uk/5/3/2.html.

  • Jin, Y., & Branke, J. (2005). Evolutionary optimization in uncertain environments–a survey. IEEE Transactions on Evolutionary Computation, 9(3), 303–317.

    Article  Google Scholar 

  • Kahneman, D., Slovic, P., & Tversky, A. (1982). Judgment under uncertainty: Heuristics and biases. Cambridge: Cambridge University Press.

    Google Scholar 

  • Kauffman, S. (1993). The origins of order: self-organization and selection in evolution. New York: Oxford University Press.

    Google Scholar 

  • Kauffman, S. (1995). At home in the universe: the search for laws of complexity. London: Penguin.

    Google Scholar 

  • Kauffman, S. (2000). Investigations. Oxford: Oxford University Press.

    Google Scholar 

  • Kirkpatrick, S., Gelatt, C. D., & Vecchi, M. P. (1983). Optimization by simulated annealing. Science, 220, 671–680.

    Article  MathSciNet  Google Scholar 

  • Latour, B. (1987). Science in action. Cambridge: Harvard University Press.

    Google Scholar 

  • Latour, B. (1996). Aramis or the love of technology. Cambridge: Harvard University Press.

    Google Scholar 

  • Law, J., & Callon, M. (1992). The life and death of an aircraft: A network analysis of technical change. In W. Bijker & J. Law (Eds.), Shaping technology/building society: Studies in sociotechnical change (pp. 21–52). Cambridge: The MIT Press.

    Google Scholar 

  • Lazer, D., & Friedman, A. (2007). The network structure of exploration and exploitation. Administrative Science Quarterly, 52, 667–694.

    Article  Google Scholar 

  • Levinthal, D. A. (1997). Adaptation on rugged landscapes. Management Science, 43(7), 934–950.

    Article  MATH  Google Scholar 

  • Levinthal, D. A., & Warglien, M. (1999). Landscape design: Designing for local action in complex worlds. Organization Science, 10(3), 342–357.

    Article  Google Scholar 

  • Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16, 317–323.

    Google Scholar 

  • March, J. G. (1991). Exploration and exploitation in organisational learning. Organization Science, 2(1), 71–87.

    Article  MathSciNet  Google Scholar 

  • March, J. G., & Simon, H. A. (1958). Organizations. New York: Wiley.

    Google Scholar 

  • McKelvey, B. (1999). Avoiding complexity catastrophe in coevolutionary pockets: Strategies for rugged landscapes. Organization Science, 10(3), 294–321.

    Article  Google Scholar 

  • McPherson, M., Smith-Lovin, L., & Cook, J. M. (2001). Birds of a feather: Homophily in social networks. Annual Review of Sociology, 27, 415–444.

    Article  Google Scholar 

  • Merton, R. K. (1968). The Matthew effect in science. Science, 159(3810), 56–63.

    Article  Google Scholar 

  • Merton, R. K. (1988). The Matthew effect in science, II. Cumulative advantage and the symbolism of intellectual property. ISIS, 79, 606–623.

    Article  Google Scholar 

  • Mitchell, M. (1996). An introduction to genetic algorithms. Cambridge: MIT Press.

    Google Scholar 

  • Mulkay, M. J., Gilbert, G. N., & Woolgar, S. (1975). Problem areas and research networks in science. Sociology, 9, 187–203.

    Article  Google Scholar 

  • Newman, M. E. J. (2001a). Scientific collaboration networks. I. Network construction and fundamental results. Physical Review E, 64, 016131.

    Article  Google Scholar 

  • Newman, M. E. J. (2001b). Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Physical Review E, 64, 016132.

    Article  Google Scholar 

  • Newman, M. E. J. (2001c). The structure of scientific collaboration networks. Proceedings of the National Academy of Science USA, 98(2), 404–409.

    Article  MATH  Google Scholar 

  • Newman, M. E. J. (2003). The structure and function of complex networks. SIAM Review, 45, 167–256.

    Article  MathSciNet  MATH  Google Scholar 

  • Price, D. J. de S. (1963). Little science, big science. New York: Columbia University Press.

  • Price, D. J. de S. (1965). Networks of scientific papers. Science, 149(3683), 510–515.

  • Price, D. J. de S. (1976). A general theory of bibliometric and other cumulative advantage processes. Journal of the American Society for Information Science, 27, 292–306.

  • Redner, S. (1998). How popular is your paper? An empirical study of the citation distribution. The European Physical Journal B, 4, 131–134.

    Article  Google Scholar 

  • Sandstrom, P. E. (1999). Scholars as subsistence foragers. Bulletin of the American Society for Information Science, 25(3), 17–20.

  • Scharnhorst, A. (1998). Citation–networks, science landscapes and evolutionary strategies. Scientometrics, 43(1), 95–106.

    Article  MathSciNet  Google Scholar 

  • Scharnhorst, A. (2002). Evolution in adaptive landscapes - examples of science and technology development. Discussion Paper FS II 00–302. Berlin: Wissenschaftszentrum Berlin für Sozialforschung.

  • Scharnhorst, A., & Ebeling, W. (2005). Evolutionary search agents in complex landscapes. A new model for the role of competence and meta-competence (EVOLINO and other simulation tools). arXiv:0511232. Retrieved April 16, 2010 from http://arxiv.org/abs/physics/0511232.

  • Schubert, A., & Glaenzel, W. (1984). A dynamic look at a class of skew distributions. A model with scientometric applications. Scientometrics, 6(3), 149–167.

    Article  Google Scholar 

  • Simon, H. A. (1955). On a class of skew distribution functions. Biometrika, 42(3/4), 425–440.

    Article  MathSciNet  MATH  Google Scholar 

  • Steels, L. (2001). The methodology of the artificial. Commentary on Webb, B. (2001) Can robots make good models of biological behaviour? Behavioral and Brain Sciences, 24(6), 1071–1072. Retrieved May 5 2008 from http://www.csl.sony.fr/downloads/papers/2001/steels.html.

  • Watts, D. J. (2004). The ‘New’ science of networks. Annual Review of Sociology, 30, 243–270.

    Article  Google Scholar 

  • Watts, D. J., & Strogatz, S. H. (1998). Collective dynamics of ‘small-world’ networks. Nature, 393, 440–442.

    Article  Google Scholar 

  • Weisberg, M., & Muldoon, R. (2009). Epistemic landscapes and the division of cognitive labor. Philosophy of Science, 76(2), 225–252.

    Article  Google Scholar 

  • Whitley, R. (2000). The intellectual and social organization of the sciences. Oxford: Oxford University Press.

    Google Scholar 

  • Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation, 1(1), 67–82.

    Article  Google Scholar 

  • Zuckerman, H. (1977). Scientific Elite: Nobel Laureates in the United States. New York: Free Press.

    Google Scholar 

Download references

Acknowledgments

This research was supported by SIMIAN (Simulation Innovation: A Node), a part of the UK’s National Centre for Research Methods, funded by the Economic and Social Research Council.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christopher Watts.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Watts, C., Gilbert, N. Does cumulative advantage affect collective learning in science? An agent-based simulation. Scientometrics 89, 437–463 (2011). https://doi.org/10.1007/s11192-011-0432-8

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-011-0432-8

Keywords

Mathematic subject classification

JEL classification

Navigation